Information processing apparatus, portable device, electronic pet apparatus recording medium storing information processing procedures and information processing method

ABSTRACT

In an information processing apparatus, a portable device, an electronic pet apparatus, recording medium storing information processing procedures and an information processing method, various kinds of data is transmitted via a network, and in addition, words can be catalogued via voice. Further, various responses are made in accordance with user authentication, voice inputs and responses are classified into categories which are used as a basis for generating a response. Furthermore, the emotion of the electronic pet can be changed on the basis of a past history.

TECHNICAL FIELD

The present invention relates to an information processing apparatus, aportable device, an electronic pet apparatus, recording medium storinginformation processing procedures and an information processing method,and can be applied to various kinds of information device such as mobiletelephones and personal computers. By exchanging various kinds of datarequired in generation of a response via a network and by using voice tocatalogue words, the present invention realizes a more familiarelectronic pet apparatus, an information processing apparatus with anelectronic pet, a portable device and a recording medium storinginformation processing procedures.

BACKGROUND ART

For conventional personal computers, so-called rearing simulation gamesoftware has been proposed. The rearing simulation game software is agame to rear a pet (that is, an electronic pet) in a virtual realityspace provided by a computer. The pet rearing simulation game softwareallows easy communications with an electronic pet in comparison withreally rearing a pet.

By the way, a real pet performs various kinds of action depending to thephysical condition thereof, the surrounding environment and so on. Inaddition, the pet recognizes the owner and performs actions differentfrom actions to others. Moreover, the behavior may be changed bylearning.

If an electronic pet is capable of imitating a variety of behaviors of areal pet, the electronic pet can be considered to be more familiar.

DISCLOSURE OF INVENTION

It is an object of the present invention addressing the problemsdescribed above to provide a more familiar electronic pet apparatus, aninformation processing apparatus with an electronic pet, a portabledevice, a recording medium storing information processing procedures andan information processing method.

In order to solve the problems described above, the present invention isapplied to an information processing apparatus, a portable device or anelectronic pet apparatus, and relating to: a voice recognition means foroutputting a result of voice recognition in conformity with apredetermined recognition rule; an emotion generation means forgenerating an emotion parameter, which varies at least in accordancewith the result of voice recognition and the lapse of time and indicatesan emotion in a pseudo manner, in conformity with a predeterminedemotion-parameter generation rule; and a response generation means forgenerating a response to the result of voice recognition in conformitywith a predetermined response generation rule based on at least theemotion parameter, the following means is included: a communicationmeans for carrying out processing to update the recognition rule, theemotion-parameter generation rule and the response generation rule byconnection to a predetermined network; or a communication means forcarrying out processing to update data required in the recognition rule,the emotion-parameter generation rule and the response generation ruleby connection to the predetermined network.

In addition, the present invention is applied to an informationprocessing apparatus, a portable device or an electronic pet apparatus,and also includes a communication means for acquiring at least theemotion parameter or data required in generation of the emotionparameter by connection to a predetermined network wherein the responsegeneration means generates a response depending on the emotion parameteracquired by the communication means or a response depending on anemotion parameter generated from the data acquired by the communicationmeans.

Furthermore, the present invention also provides a recording mediumstoring information processing procedures including: communicationprocessing to execute a process to update the recognition rule, theemotion-parameter generation rule or the response generation rule byconnection to a predetermined network; or communication processing toexecute a process to update data required for the recognition rule, theemotion-parameter generation rule or the response generation rule byconnection to the predetermined network.

Moreover, the present invention also provides a recording medium storinginformation processing procedures including: communication processing toacquire at least the emotion parameter or data required in generation ofthe emotion parameter by connection to a predetermined network fromequipment of the same type connected to the network; and responsegeneration processing to generate a response depending on the emotionparameter acquired by the communication processing or a responsedepending on an emotion parameter generated from the data acquired bythe communication processing.

Further, the present invention is applied to an information processingmethod and comprises: communication processing to execute a process toupdate the recognition rule, the emotion-parameter generation rule orthe response generation rule by connection to a predetermined network;or communication processing to execute of a process to update datarequired for the recognition rule, the emotion-parameter generation ruleor the response generation rule by connection to the predeterminednetwork.

In addition, the present invention is applied to an informationprocessing method and comprises: communication processing to acquire atleast the emotion parameter or data required in generation of theemotion parameter by connection to a predetermined network; and responsegeneration processing to output a response depending on the emotionparameter acquired by the communication processing or a responsedepending on an emotion parameter generated from the data acquired bythe communication processing.

Moreover, the present invention is applied to an information processingapparatus, a portable device or an electronic pet apparatus, includes acataloging means capable of changing a data base via voice, on the basisof a result of voice recognition by at least cataloging a word obtainedas a result of voice recognition in the data base in a catalogingoperation mode.

Furthermore, the present invention also provides a recording mediumstoring information processing procedures including catalogingprocessing capable of changing a data base via voice the basis of aresult of voice recognition by at least cataloging a word obtained as aresult of voice recognition in the data base in a cataloging operationmode.

Furthermore, the present invention is applied to an informationprocessing method and comprises cataloging processing capable ofchanging a data base via voice on the basis of a result of voicerecognition by at least cataloging a word obtained as a result of voicerecognition in the data base in a cataloging operation mode.

In addition, as an application to an information processing apparatus, aportable device or an electronic pet apparatus, the present inventionhas a user authentication means for authenticating the user on the basisof voice wherein the response generation means changes a response inaccordance with the user and in dependence on a result of authenticationoutput by the user authentication means.

Furthermore, the present invention also provides a recording mediumstoring information processing procedures including user authenticationprocessing of authenticating the user on the basis of voice and responsegeneration processing of changing a response in accordance with the userand in dependence on a result of authentication output by the userauthentication processing.

Furthermore, as an application to an information processing method, thepresent invention comprises user authentication processing ofauthenticating the user on the basis of voice and response generationprocessing of changing a response in accordance with the user and independence on a result of authentication output by the userauthentication processing.

In addition, as an application to an information processing apparatus, aportable device or an electronic pet apparatus, the present inventionhas a word/phrase classification means for identifying the type of aninput expressed by voice in generation of a response to a result ofvoice recognition wherein a response generation rule is set as a rulefor generating responses excluding a response of a predetermined type inaccordance with the type of the voice input and on the basis ofclassification of responses according to classification of voice inputs.

Furthermore, the present invention also provides a recording mediumstoring, on the basis of the voice input, an information processingprocedure prescribing word/phrase classification processing to identifythe type of an input expressed by a voice in generation of a response toa result of voice recognition processing to set a response generationrule as a rule for generating responses excluding a response of apredetermined type in accordance with the type of the voice input and onthe basis of classification of responses according to classification ofvoice inputs.

Further, as an application to an information processing method, thepresent invention comprises information processing procedure forrecognizing the type of voice input and generating a response to theresult of voice recognition in accordance with the predeterminedresponse generation rule which is a rule of generating responsesexcluding a response of a predetermined type in accordance with the typeof an input and a category of a response to the input.

In addition, as an application to an information processing apparatus, aportable device or an electronic pet apparatus, the present inventionhas a history recording means for recording a history of at leastresults of voice recognition and emotion parameters corresponding toresults of voice recognition wherein a change in emotion parametercorresponding to a result of voice recognition is varied in accordancewith the history.

Furthermore, the present invention also provides a recording mediumstoring information processing procedures prescribing history recordingprocessing to record a history of at least results of voice recognitionand emotion parameters corresponding to results of voice recognition tovary a change in emotion parameter corresponding to a result of voicerecognition in accordance with the history.

On the top of that, as an application to an information processingmethod, the present invention comprises history recording processing torecord a history of at least results of voice recognition and emotionparameters corresponding to results of voice recognition to vary achange in emotion parameter corresponding to a result of voicerecognition in accordance with the history.

In addition, as an application to an information processing apparatus, aportable device or an electronic pet apparatus, the present inventionrelates to: a voice recognition means for processing a voice andoutputting a result of voice recognition in conformity with apredetermined recognition rule; an emotion generation means forgenerating an emotion parameter, which indicates an emotion in a pseudomanner as well as varies at least in accordance with the result of voicerecognition and varies with the lapse of time, in conformity with apredetermined emotion-parameter generation rule; and a responsegeneration means for generating a response to a result of voicerecognition in conformity with a predetermined response generation rulebased on at least the emotion parameter, wherein there is included: acommunication means for carrying out processing to update therecognition rule, the emotion-parameter generation rule and the responsegeneration rule by connection to a predetermined network; or acommunication means for carrying out processing to update data requiredin the recognition rule, the emotion-parameter generation rule and theresponse generation rule by connection to the predetermined network.

Accordingly, the communication means is capable of outputting variouskinds of data required in the generation of a response. Thus, equipmentof the same type connected to the network is capable of generatingalmost the same response as a response to a voice input in thisinformation processing apparatus, the portable device or the electronicpet apparatus. As a result, an electronic pet can be treated as if theelectronic pet were taken out to the external equipment connected to thenetwork and, moreover, the electronic pet can also be made easy to getacquainted with as if the electronic pet were a real pet in the courseof actual training.

In addition, as an application to an information processing apparatus, aportable device or an electronic pet apparatus, the present inventionincludes a communication means for acquiring at least an emotionparameter or data required in generation of an emotion parameter byconnection to a predetermined network wherein the response generationmeans generates a response depending on the emotion parameter acquiredby the communication means or a response depending on an emotionparameter generated from the data acquired by the communication means.Thus, the response generation means is capable of generating almost thesame response as a response to a voice input in equipment of the sametype connected to the network. As a result, an electronic pet can betreated as if the electronic pet were taken out from the equipment ofthe same type connected to the network and, moreover, the electronic petcan be made easy to get acquainted with as if the electronic pet were areal pet in the course of actual training. In addition, the amount ofknowledge can be enlarged if necessary typically by making thevocabulary of words that can be understood by the electronic pet larger.

Furthermore, the present invention also provides a recording mediumstoring information processing procedures prescribing: communicationprocessing to execute a process to update the recognition rule, theemotion-parameter generation rule or the response generation rule byconnection to a predetermined network; or communication processing toexecute a process to update data required for the recognition rule, theemotion-parameter generation rule or the response generation rule byconnection to the predetermined network.

Thus, equipment of the same type connected to the network is capable ofgenerating almost the same response as a response to voice input in anapparatus executing the information processing procedure stored in thisrecording medium. As a result, an electronic pet can be treated as ifthe electronic pet were taken out to the external equipment and,furthermore, the electronic pet can be made easy to get acquainted withas if the electronic pet were a real pet in the course of actualtraining.

Moreover, the present invention also provides a recording medium storinginformation processing procedures prescribing: communication processingto acquire at least an emotion parameter or data required in generationof an emotion parameter by connection to a predetermined network; andresponse generation processing to generate a response depending on theemotion parameter acquired by the communication processing or a responsedepending on an emotion parameter generated from the data acquired bythe communication processing.

Thus, an apparatus executing the information processing procedure storedin this recording medium is capable of generating almost the sameresponse as a response to a voice input in the equipment of the sametype connected to the network. As a result, an electronic pet can betreated as if the electronic pet were taken out from the equipment ofthe same type connected to the network and, moreover, the electronic petcan be made easy to get acquainted with as if the electronic pet were areal pet in the course of actual training. In addition, the amount ofknowledge can be enlarged if necessary typically by making thevocabulary of words that can be understood by the electronic pet larger.

On the top of that, as an application to an information processingmethod, the present invention comprises: communication processing toexecute a process to update the recognition rule, the emotion-parametergeneration rule or the response generation rule by connection to apredetermined network; or communication processing to execute a processto update data required for the recognition rule, the emotion-parametergeneration rule or the response generation rule by connection to apredetermined network.

Thus, equipment of the same type connected to the network is capable ofgenerating almost the same response as a response to a voice input in anapparatus executing the information processing method. As a result, anelectronic pet can be treated as if the electronic pet were taken out tothe external equipment and, furthermore, the electronic pet can be madeeasy to get acquainted with as if the electronic pet were a real pet inthe course of actual training.

In addition, as an application to an information processing method, thepresent invention comprises: communication processing to acquire atleast the emotion parameter or data required in generation of theemotion parameter by connection to a predetermined network; and responsegeneration processing to generate a response depending on the emotionparameter acquired by the communication processing or a responsedepending on an emotion parameter generated from the data acquired bythe communication processing.

Thus, an apparatus executing this information processing method iscapable of generating almost the same response as a response to a voiceinput in the equipment of the same type connected to the network. As aresult, an electronic pet can be treated as if the electronic pet weretaken out from the equipment connected to the network and, moreover, theelectronic pet can be made easy to get acquainted with as if theelectronic pet were a real pet in the course of actual training. Inaddition, the amount of knowledge can be enlarged if necessary typicallyby making the vocabulary of words that can be understood by theelectronic pet larger.

Moreover, as an application to an information processing apparatus, aportable device or an electronic pet apparatus, the present inventionhas a cataloging means capable of changing a data base in accordancewith a voice input in a cataloging operation mode based on a result ofvoice recognition by at least cataloging a word obtained as a result ofvoice recognition in the data base. Thus, the vocabulary of words thatcan be understood by an electronic pet can be made larger with ease byvoice inputs. As a result, the electronic pet can be made easy to getacquainted with as if the electronic pet were a real pet in the courseof actual training.

Furthermore, the present invention also provides a recording mediumstoring information processing procedures prescribing catalogingprocessing capable of changing a data base in accordance with a voiceinput in a cataloging operation mode based on a result of voicerecognition by at least cataloging a word obtained as a result of voicerecognition in the data base.

Thus, the vocabulary of words that can be understood by an electronicpet can be made larger with ease by voice inputs in an apparatusexecuting the information processing procedure stored in this recordingmedium. As a result, the electronic pet can be made easy to getacquainted with as if the electronic pet were a real pet in the courseof actual training.

On the top of that, as an application to an information processingmethod, the present invention comprises cataloging processing capable ofchanging a data base in accordance with a voice input in a catalogingoperation mode based on a result of voice recognition by at leastcataloging a word obtained as a result of voice recognition in the database. By executing this information processing method, the vocabulary ofwords that can be understood by an electronic pet can thus be madelarger with ease by voice inputs. As a result, the electronic pet can bemade easy to get acquainted with as if the electronic pet were a realpet in the course of actual training.

In addition, as an application to an information processing apparatus, aportable device or an electronic pet apparatus, the present inventionhas a user authentication means for authenticating the user wherein theresponse generation means changes a generated response in accordancewith the user and in dependence on a result of authentication output bythe user authentication means. Thus, the response of an electronic petto the owner can be made different for example from that to a personother than the owner. As a result, the electronic pet can be made a petwhich is easier to get acquainted with and behaves as if the electronicpet were a real pet.

Furthermore, the present invention also provides a recording mediumstoring information processing procedures prescribing userauthentication processing of authenticating the user and responsegeneration processing of changing a generated response in accordancewith the user and in dependence on a result of authentication output bythe user authentication processing. Thus, the response of an electronicpet to the owner can be made different for example from that to a personother than the owner. As a result, the electronic pet can be made a petwhich is easier to get acquainted with and behaves as if the electronicpet were a real pet.

On the top of that, as an application to an information processingmethod, the present invention comprises user authentication processingof authenticating the user and response generation processing ofchanging a generated response in accordance with the user and independence on a result of authentication output by the userauthentication processing. Thus, the response of an electronic pet tothe owner can be made different for example from that to a person otherthan the owner. As a result, the electronic pet can be made a pet whichis easier to get acquainted with and behaves as if the electronic petwere a real pet.

In addition, as an application to an information processing apparatus, aportable device or an electronic pet apparatus, the present inventionhas a word/phrase classification means for identifying the type of aninput expressed by a voice in generation of a response to a result ofvoice recognition wherein a response generation rule is set as a rulefor generating responses excluding a response of a predetermined type inaccordance with the type of the voice input and on the basis ofclassification of responses according to classification of voice inputs.It is thus possible to prevent an electronic pet from outputting anunnatural response such as a question raised in response to an inquiry.As a result, the response of the electronic pet can be made natural aswell as lively. In addition, the electronic pet can be made easier toget acquainted with.

Furthermore, the present invention also provides a recording mediumstoring information processing procedures prescribing word/phraseclassification processing to identify the type of an input expressed bya voice in generation of a response to a result of voice recognitionprocessing to set a response generation rule as a rule for generatingresponses excluding a response of a predetermined type in accordancewith the type of the voice input and on the basis of classification ofresponses according to classification of voice inputs. It is thuspossible to prevent an electronic pet from outputting an unnaturalresponse such as a question raised in response to an inquiry. As aresult, the response of the electronic pet can be made natural as wellas lively. In addition, the electronic pet can be made easier to getacquainted with.

On the top of that, as an application to an information processingmethod, the present invention comprises information processing procedurefor recognizing the type of voice input and generating a response to theresult of voice recognition in accordance with the predeterminedresponse generation rule which is a rule of generating responsesexcluding a response of a predetermined type in accordance with the typeof an input and a category of a response to the input. It is thuspossible to prevent an electronic pet from outputting an unnaturalresponse such as a question raised in response to an inquiry. As aresult, the response of the electronic pet can be made natural as wellas lively. In addition, the electronic pet can be made easier to getacquainted with.

In addition, as an application to an information processing apparatus, aportable device or an electronic pet apparatus, the present inventionhas a history recording means for recording a history of at leastresults of voice recognition and emotion parameters corresponding toresults of voice recognition wherein a change in emotion parametercorresponding to a result of voice recognition is varied in accordancewith the history. It is thus possible to create an electronic pet'sresponse full of emotions of familiarity, intimacy and the like to forexample a voice heard frequently. As a result, the response of theelectronic pet can be made natural as well as lively. In addition, theelectronic pet can be made easier to get acquainted with.

Furthermore, the present invention also provides a recording mediumstoring information processing procedures prescribing history recordingprocessing to record a history of at least results of voice recognitionand emotion parameters corresponding to results of voice recognition tovary a change in emotion parameter corresponding to a result of voicerecognition in accordance with the history. It is thus possible tocreate an electronic pet's response full of emotions of familiarity,intimacy and the like to for example a voice heard frequently. As aresult, the response of the electronic pet can be made natural as wellas lively. In addition, the electronic pet can be made easier to getacquainted with.

On the top of that, as an application to an information processingmethod, the present invention comprises history recording processing torecord a history of at least results of voice recognition and emotionparameters corresponding to results of voice recognition to vary achange in emotion parameter corresponding to a result of voicerecognition in accordance with the history. It is thus possible tocreate an electronic pet's response full of emotions of familiarity,intimacy and the like to for example a voice heard frequently. As aresult, the response of the electronic pet can be made natural as wellas lively. In addition, the electronic pet can be made more familiar.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a functional block diagram showing an electronic pet apparatusas implemented by an embodiment of the present invention.

FIG. 2 is a diagram showing a front view of the electronic pet apparatusshown in FIG. 1.

FIG. 3 is a hardware block diagram showing the electronic pet apparatusof FIG. 1.

FIG. 4 shows a table of data representing the physical condition.

FIG. 5 shows a table of data representing an emotion.

FIG. 6 shows a table of character data.

FIG. 7 shows a table of data representing a changed emotion.

FIG. 8 shows rules described in pattern data.

FIG. 9 shows a table of files each containing voice data.

FIG. 10 shows a table of files each containing picture data.

FIG. 11 shows a flowchart representing a connection processing procedurefor connecting the electronic pet apparatus to a network.

FIG. 12 is a diagram showing the format of data output to the network.

FIG. 13 is a functional block diagram showing the electronic petapparatus in more detail in an operation to catalog recognition data.

FIG. 14 is a diagram showing syntax of a voice input subjected to avoice recognition process.

FIG. 15 shows a flowchart representing a processing procedure forcataloging recognition data.

FIG. 16 is a functional block diagram showing the electronic petapparatus in an operation to authenticate the user in more detail.

FIG. 17 shows rules of pattern data.

FIG. 18 shows a typical dialog to know a favorite of the user to be usedlater in authentication of the user.

FIG. 19 shows a typical dialog to authenticate the user by using thefavorite obtained during the conversation shown in FIG. 18.

FIG. 20 is a functional block diagram showing the electronic petapparatus in processing to classify a word or a phrase in more detail.

FIG. 21 shows rules of pattern data for creation of responses to avariety of categories each obtained as a result of the processing toclassify a word or a phrase shown in FIG. 20.

FIG. 22 shows a typical conversation history.

FIG. 23 shows another typical conversation history.

FIG. 24 is a functional block diagram showing the electronic petapparatus in execution of emotion control in more detail.

FIG. 25 is a table of variations in emotions (character data) fordifferent keywords each included in a user voice input.

FIG. 26 is a table summarizing the variations in emotions (characterdata) shown in FIG. 25.

FIG. 27 is a table summarizing changed variations in emotions (characterdata).

FIG. 28 is a hardware block diagram showing an electronic pet apparatusas implemented by another embodiment of the present invention.

FIG. 29 is a diagram showing a front view of a portable telephone.

BEST MODE FOR CARRYING OUT THE INVENTION 1. First Embodiment

1-1. Overall Configuration of the First Embodiment

FIG. 2 is a diagram showing a front view of an electronic pet apparatus1 implemented by a first embodiment of the present invention. As shownin the figure, the electronic pet apparatus 1 includes an antenna 2which can be pulled out upward and a liquid-crystal display panel 3 onthe upper portion of the front surface. The liquid-crystal display panel3 employed in the electronic pet apparatus 1 displays the figure of anelectronic pet and a message issued by the electronic pet. Under theliquid-crystal display panel 3, the electronic pet apparatus 1 includesa confirm operator 4A, a cancel operator 4B and a cursor operator 5.These operators are operated to change the operating mode and toaccomplish other purposes.

The electronic pet apparatus 1 further includes a speaker 6 and amicrophone 7 beneath the confirm and cancel operators 4A and 4Brespectively. A conversation can be held with the electronic pet throughthe speaker 6 and the microphone 7. Furthermore, the electronic petapparatus 1 has a socket on the rear surface. The socket allows an ICcard 8 to be mounted on the electronic pet apparatus 1.

FIG. 3 is a block diagram showing hardware of the electronic petapparatus 1. As shown in the figure, the electronic pet apparatus 1includes an analog-to-digital (A/D) conversion circuit 10 for convertingan audio analog signal coming from the microphone 7 by way of anamplifier circuit not shown in the figure into digital audio data DA.The analog-to-digital conversion circuit 10 outputs the digital audiodata DA to a central processing unit (CPU) 11. In this way, theelectronic pet apparatus 1 is capable of processing a voice entered bythe user by using the central processing unit 11.

On the other hand, a digital-to-analog (D/A) conversion circuit 12converts digital audio data DB produced by the central processing unit11 into an analog audio signal which is output to the speaker 6. In thisway, the user is capable of verifying a voice of the electronic petgenerated by the electronic pet apparatus 1 to express a responsegenerated by the electronic pet.

Controlled by the central processing unit 11, a monitor interface(monitor I/F) 13 drives the liquid-crystal display panel 3 to display apicture of the electronic pet on the liquid-crystal display panel 3 inaccordance with picture data DV coming from the central processing unit11 by way of a bus.

A key interface (key I/F) 14 detects an operation carried out by theuser on the operator 4A, 4B or 5, supplying a detection signal to thecentral processing unit 11. A read-only memory (ROM) 15 is used forstoring information such as a processing program to be executed by thecentral processing unit 11 and various kinds of data necessary for ananalysis of a voice acquired through the microphone 7. The centralprocessing unit 11 reads out information from the read-only memory 15 tobe output also under control executed by the central processing unit 11.A random-access memory (RAM) 16 serves as a work area of the centralprocessing unit 11. The random-access memory 16 is used for temporarilystoring various kinds of data necessary for processing carried out bythe central processing unit 11.

Controlled by the central processing unit 11, a network connection unit17 connects the electronic pet apparatus 1 to a predetermined network 18through a telephone line. The electronic pet apparatus 1 exchangesvarious kinds of data DT with the network 18 and, when necessary,updates information such as contents of the random-access memory 16 byusing the exchanged data. To put it in detail, the electronic petapparatus 1 is thus capable of acquiring various kinds of data requiredfor training and nurturing the electronic pet from the network 18 whennecessary. In addition, data stored in the random-access memory 16 maybe transmitted to a desired terminal by way of the network 18. As aresult, the electronic pet can be treated as if the pet were taken outto a variety of environments by exporting data to terminals connected tothe network 18. On the contrary, an electronic pet of another apparatusconnected to the terminal 18 can be trained by using this electronic petapparatus.

The IC card 8 is an external recording device that can be mounted anddismounted. If necessary, data stored in the IC card is used forupdating information such as the contents of the random-access memory16, or data stored in the random-access memory 16 can be transferred tothe IC card 8. In this way, the electronic pet apparatus 1 is capable ofexchanging various kinds of data with other equipment through the ICcard 8, making it possible to acquire various kinds of data necessaryfor training and nurturing of the electronic pet. In addition, theelectronic pet can be treated as if the pet were taken out to a varietyof environments and, on the contrary, an electronic pet of anotherapparatus can be trained by using this electronic pet apparatus 1.

FIG. 1 is a block diagram showing a basic configuration of theelectronic pet apparatus 1 in terms of functional blocks. It should benoted that rectangular functional blocks shown in FIG. 1 each representa processing program stored in the read-only memory 15 to be executed bythe central processing unit 11. On the other hand, a functional blockdrawn as a symbol of a magnetic disc represents data stored in theread-only memory 15, the random-access memory 16 or the IC card 8.

A voice recognition module 11A employed in the electronic pet apparatus1 carries out a voice recognition processing on audio data DA inconformity with a predetermined recognition rule, generating a result ofvoice recognition as an output. To put it in detail, the voicerecognition module 11A delimits voice represented by the sequentiallyreceived audio data DA by phonemes in accordance with a HMM (HiddenMarcov Model) method. The voice recognition module 11A referencesrecognition data 16A for a series of such phonemes. The voicerecognition module 11A produces words of the audio data DA, words of aphrase cataloged in advance and, in the case of a phrase, words of thephrase or text data representing the phrase on the basis of results ofthe reference to the recognition data 16A as results of recognition. Therecognition data 16A is a data base associating text data of words andphrases with a series of phonemes output by the HMM method. That is tosay, the recognition data 16A is a data base used for storing pairs eachcomprising text data and a phoneme. Such a data base allows theelectronic pet apparatus 1 to convert a voice of “A Good kid” said bythe user in front of the microphone 7 into an array of charactersrepresenting a text of “A Good kid.” As a result, a voice input isconverted into an array of characters.

A timer 11B invokes components such as a physical-condition changingmodule 11C and an emotion changing module 11D at predeterminedintervals.

When activated by the timer 11B, the physical-condition changing module11C updates physical-condition data 16B in accordance with a result ofvoice recognition. The physical-condition data 16B includes parametersrepresenting the present physical condition of the electronic pet. Itshould be noted that, in the case of this embodiment, thephysical-condition data 16B comprises 5 parameters called “fatigue”,“hunger”, “thirstiness”, “sickness” and “sleepiness” respectively asshown in FIG. 4. The larger the value of a parameter, the greater theshare of the parameter in the physical condition of the electronic pet.The typical values shown in FIG. 4 thus indicate that, at the presenttime, the electronic pet is extremely tired and very hungry.

As described above, the physical-condition changing module 11C updatesthe physical-condition data 16B in accordance with a result of voicerecognition as activated by the timer 11B. For example, the “hunger”,“thirstiness” and “sleepiness” parameters are increased gradually inconformity with the rule of nature as is generally seen in the course oftypical nurturing of a real pet. As a result, the electronic pet getshungry with the lapse of time. Another example of an operation to updatethe physical-condition data 16B in accordance with a result of voicerecognition is an operation to decrease the “hunger” parameter when aresult of voice recognition indicates that food has been given to theelectronic pet. Still another example of an operation to update thephysical-condition data 16B in accordance with a result of voicerecognition is an operation to decrease the “thirstiness” parameter whena result of voice recognition indicates that a drink has been given tothe electronic pet. A further example of an operation to update thephysical-condition data 16B in accordance with a result of voicerecognition is an operation to gradually increase the “fatigue”parameter when a result of voice recognition indicates that owner isplaying with the electronic pet. A still further example of an operationto update the physical-condition data 16B in accordance with a result ofvoice recognition is an operation to gradually decrease the “sleepiness”parameter synchronously with a timer when a result of voice recognitionindicates that the owner tells the electronic pet to sleep.

On the other hand, the emotion changing module 11D updates the presentemotion data 16C in accordance with a result of voice recognition asactivated by the timer 11B. The present emotion data 16C includesvariables representing emotions of the current electronic pet in apseudo manner. Such variables are each also referred to as a pseudoemotion parameter. It should be noted that, in the case of thisembodiment, there are 6 pseudo emotion parameters which represent“anger”, “sadness”, “joy”, “fear”, “surprise” and “hatred” emotionsrespectively as shown in FIG. 5. The larger the value of a pseudoemotion parameter, the greater the emotion represented by the parameter.A typical set of values of pseudo emotion parameters shown in FIG. 5indicate that, at the present time, the electronic pet is joyful butangry.

As described above, the emotion changing module 11D updates the emotiondata 16C in conformity with the rule of nature as is generally seen in athe course of typical nurturing of a real pet. That is to say, when theemotion changing module 11D updates the emotion data 16C as activated bythe timer 11B, the 6 variables, namely, the “anger”, “sadness”, “joy”,“fear”, “surprise” and “hatred” emotion parameters, are each graduallyupdated so as to approach a predetermined reference value. Thus, in thecase of the example shown in FIG. 5, the “anger”, the “sadness” and theother emotions are gradually settled.

When the emotion changing module 11D updates the emotion data 16C inaccordance with a result of voice recognition, on the other hand,character data 16D is searched for information indicated by the resultof voice recognition and the information is then used as a basis forupdating the emotion data 16C.

As shown in FIG. 6, the character data 16D comprises changes in emotiondata 16C classified by phrase (or word) included in a result of voicerecognition. That is to say, by using a phrase included in a result ofvoice recognition as a keyword, the character data 16D can be searchedfor variations in emotion data 16C proper for the phrase (or the word).Assume for example that the user says: “Good” to the electronic pet. Inthis case, the “anger”, “sadness”, “joy”, “fear”, “surprise” and“hatred” emotion parameters are changed by −1, +2, +20, −5, +5 and −1respectively as shown in FIG. 6. In other words, variations in emotiondata 16C of −1, +2, +20, −5, +5 and −1 are assigned to the word “Good”.

Thus, when the user says: “A Good kid,” for example, the emotionchanging module 11D updates the emotion data 16C shown in FIG. 5 to thatshown in FIG. 7. In this way, the emotion changing module 11D serves asan emotion generation means which generates pseudo emotion parameterseach representing an emotion in a pseudo manner and updates the emotiondata on the basis of a predetermined emotion-parameter generation ruleat least in accordance with a result of voice recognition. In addition,the pseudo emotion parameters also vary with the lapse of time.

A response-sentence creation module 11E generates a response to a resultof voice recognition in accordance with predetermined responsegeneration rules based on the physical-condition data 16B and theemotion data 16C. Pattern data 16E is a set of rules for generation ofsuch a response. As shown in FIG. 8, each of the rules describes aresponse to an input key phrase which includes a word obtained as aresult of voice recognition. Determined by a key phrase, a responsedescribed by a rule also varies in accordance with the emotion data 16Cand the physical-condition data 16B. It should be noted that onlyminimum require rules are shown in FIG. 8 in order to make theexplanation simple. Actual rules prescribe conditions (includingattributes to be described later) other than the conditions shown inFIG. 8. Rule 2 shown in FIG. 8 is an example of a rule based on emotiondata 16C only. It should be noted that a rule can be based on acombination of the emotion data 16C and the physical-condition data 16B.

Rule 1 shown in FIG. 8 prescribes response phrases to an input phrase “Ilove you” or “I like you.” According to Rule 1, if the input phrase is avoice of an authenticated user, a response phrase saying: “I love you,too” or “Wow, I am a male though” is output at random. If the inputphrase is not a voice of an authenticated user, on the other hand, aresponse phrase saying: “A strange person” or “Who are you?” is outputat random.

Rule 2 shown in FIG. 8 prescribes response phrases to an input phrase“Good day” or “Hello.” As described above, the response phrases arebased on the “anger”, “sadness”, “joy”, “fear”, “surprise” and “hatred”emotions of the emotion data. To be more specific, a response phrasesaying: “Shut up”, “What?”, “Howdy”, “I am surprised”, “Hi”, or “Did youcall me?” is selected as an output if the largest among the “anger”,“sadness”, “joy”, “fear”, “surprise” and “hatred” emotion parametersrespectively exceeds a predetermined value.

The statement ‘authenticated (A); (B)’ in Rule 1 shown in FIG. 8 meansthat if a result of user authentication or the like to be describedlater is set at a Boolean value of “TRUE”, the phrase (A) is selectedand if the result of the user authentication or the like is not set at“TRUE”, on the other hand, the phrase (B) is selected. The statement“random (“A”, “B”)” means that either the phase “A” or “B” is selectedat random.

By the way, the “joy” emotion parameter in the typical emotion data 16Cshown in FIG. 7 has the largest value among the variables. Thus,according to Rule 2, the word “Howdy” for the joy emotion is selected.

As the response-sentence creation module 11E creates a response based onthe emotion data 16C described above, depending on the input key phrase,the response-sentence creation module 11E also creates a response basedon the physical-condition data 16B or a combination of the emotion data16C and the physical-condition data 16B as mentioned earlier. With sucha response-sentence creation module 11E, when the electronic pet is inan unsatisfactory physical condition, the electronic pet apparatus 1thus generates a response corresponding to the condition.

The response-sentence creation module 11E records a generated responseto such a result of voice recognition in a conversation history 16F. Ifnecessary, the response-sentence creation module 11E generates aresponse by referring to the conversation history 16F. In this way, anunnatural conversation between the electronic pet and the user can beavoided. In addition, the response-sentence creation module 11E alsogenerates a response by referring to a knowledge base 16G. As a result,the electronic pet apparatus 1 is capable of changing the response independence on the user which is identified typically by carrying outprocessing to authenticate the user.

A voice synthesis module 11F searches voice data 16H for voice data DBcorresponding to a response output by the response-sentence creationmodule 11E, outputting the voice data DB as a search result. As shown inFIG. 9, the voice data 16H is a collection of voice files eachcorresponding to a response. For example, when the response “Howdy” isoutput, a voice file named ‘voice0005.wav’ is selected and voice data DBrecorded in this voice file is output.

A picture synthesis module 11G searches picture data 16I for picturedata DV corresponding to a response output by the response-sentencecreation module 11E, outputting the picture data DV as a search result.As shown in FIG. 10, the picture data 16I is a collection ofpicture-data files each corresponding to a response. For example, whenthe response “Howdy” is output, a picture-data file named ‘fig0005.bmp’is selected and picture data DV recorded in this picture-data file isoutput.

1-2. Connection to the Network

The central processing unit 11 executes a processing procedure shown inFIG. 11 to connect the electronic pet apparatus 1 to the network 18through the network connection unit 17. Connected to the network 18, theelectronic pet apparatus 1 is capable of transmitting thephysical-condition data 16B, the emotion data 16C and the conversationhistory 16F to desired equipment by way of the network 18. The equipmentreceiving such data from the electronic pet apparatus 1 is capable ofreproducing the electronic pet of the electronic pet apparatus 1. Inthis way, the electronic pet can be taken out to a variety ofenvironments.

On the contrary, the central processing unit 11 is capable of acquiringphysical-condition data 16B, emotion data 16C and a conversation history16F from the network 18, allowing an electronic pet raised in anotherelectronic pet apparatus to be brought out to this electronic petapparatus 1. In this case, the functional blocks of the electronic petapparatus 1 are capable of carrying out the processing based on thenative physical-condition data 16B, the native emotion data 16C and thenative conversation history 16F to emulate its own electronic pet raisedby itself in parallel to processing based on the physical-condition data16B, the emotion data 16C and the conversation history 16F imported fromthe other electronic apparatus through the network 18 to emulate anotherelectronic pet raised in the other electronic apparatus to produceresponses as if the other electronic pet were visiting this electronicpet apparatus 1. It should be noted that, in the functional blocks shownin FIG. 1, flows of data in the processing based on thephysical-condition data 16B, the emotion data 16C and the conversationhistory 16F acquired from the other electronic apparatus through thenetwork 18 are not shown.

The electronic pet unit 1 is also capable of acquiring recognition data16A, pattern data 16E, a knowledge base 16G, voice data 16H and picturedata 16I from the network 18 to increase the size of the vocabulary ofspoken words that can be recognized by the so-called electronic pet andto increase the number of response types. As a result, the electronicpet apparatus 1 is capable of raising and teaching the electronic pet.

As shown in FIG. 11, the procedure begins with a step SP1. In responseto a request for connection, the flow of the procedure goes on to a stepSP2 at which the central processing unit 11 accepts the request. Itshould be noted that such requests for connection are generatedperiodically by the timer 11B at fixed intervals. In addition, a requestfor connection can be made by the user by operating an operator.Furthermore, a connection can also be established in response to anincoming call from the network 18.

The flow of the procedure then goes on to a step SP3 at which thecentral processing unit 11 establishes a communication by carrying outpredetermined line connection processing. Then, the flow of theprocedure proceeds to a step SP4 at which the central processing unit 11exchanges various kinds of data depending on the substance of therequest for connection with a communication partner. Subsequently, theflow of the procedure proceeds to a step SP5 at which the centralprocessing unit 11 cuts off the communication. Finally, the flow of theprocedure proceeds to a step SP6 at which the central processing unit 11ends the processing procedure.

FIG. 12 is a diagram showing the format of transferred data. Theelectronic pet apparatus 1 exchanges data with a communication partnerby way of an interface included in the network connection unit 17 and aninterface in the communication partner in accordance with the format. Asshown in the figure, each piece of data DT has a header for describinginformation such as the address and the type of the data DT. Typically,the data DT includes pattern data 16E, recognition data 16A, voice data16H, picture data 16I and so on, which are arranged sequentially, asnecessary.

1-3. Cataloging Recognition Data

FIG. 13 is a functional block diagram showing the electronic petapparatus 1 in more detail in an operation to catalog recognition data16A. In this functional block diagram, a cataloging module 11I catalogsa result of voice recognition as recognition data 16A. In this way, itis possible to teach the electronic pet a variety of words orallywithout entering the words via an input unit such as a keyboard.

In order to accomplish the purpose described above, the voicerecognition module 11A processes voice data DA by adoption of the HMMmethod, outputting a series of phonemes as a result of voicerecognition. To put it in detail, a voice expressed in the Japaneselanguage is analyzed to identify its phonemes which are each indicatedby an identifier. Thus, a pronunciation in the Japanese language can beexpressed by an array of identifiers. The identifiers are listed asfollows: ‘b’, ‘d’, ‘g’, ‘p’, ‘t’, ‘k’, ‘m’, ‘n’, ‘r’, ‘z’, ‘ch’, ‘ts’,‘y’, ‘w’, ‘h’, ‘i’, ‘e’, ‘a’, ‘o’, ‘u’, ‘N’, ‘ei’, ‘ou’, ‘s’, ‘sh’,‘xy’, ‘j’, ‘f’ and ‘sil’. The phoneme ‘sil’ is soundless.

When the user says: “mikan” (“oranges”, in English) as an input, forexample, the voice recognition module 11A recognizes the voice input asa series of phonemes which are expressed by identifiers ‘sil m i k a Nsil’. The voice recognition module 11A sequentially processes the voicedata DA supplied thereto also sequentially to identify its phonemes.Results of recognition are then processed according to syntax shown inFIG. 14 to detect a series of phonemes represented by a series ofidentifiers. It should be noted that the syntax shown in FIG. 14 issyntax indicating permitted connections of all the phonemes listedabove.

In a normal operating mode, the video recognition module 11A searchesthe recognition data 16A for text data including a word or a phraseobtained as a search result corresponding to an array of identifiersdetected in this way, outputting the text data as a result ofrecognition. Thus, when a word not cataloged in the recognition data 16Ais received from the user as a voice input in this embodiment, it willbe difficult to generate text data and it is hence hard to give acorrect response to a voice input given by the user.

In order to solve this problem, the electronic pet apparatus 1implemented by this embodiment is connected to the network 18 by thenetwork connection unit 17, being capable of downloading recognitiondata 16A from the network 18. In this way, the downloaded recognitiondata 16A is taught to the electronic pet so that the electronic pet iscapable of giving responses to a variety of sayings.

In addition, in this embodiment, the central processing unit 11 executesa processing procedure shown in FIG. 15 when a catalog mode is selectedby the user. During the execution of processing procedure, the user isrequested to operate the confirm operator 4A and the cancel operator 4Bas described below. The procedure is executed to catalog a word said bythe user into the recognition data 16A.

As shown in FIG. 15, the procedure begins with a step SP11. When apredetermined operator is operated, the flow of the procedure goes on toa step SP12 to enter a catalog mode in which the central processing unit11 executes the picture synthesis module 11G to display a predeterminedmessage on the liquid-crystal display panel 3. The message requests theuser to pronounce a word.

Then, the flow of the procedure proceeds to a step SP14 at which thecentral processing unit 11 carries out voice recognition on the voicedata DA received sequentially, identifying the data DA sequentially as aseries of phonemes. As the user operates a predetermined operator to endthe voice input, the flow of the procedure goes on to a step SP15.

At the step SP15, the central processing unit 11 executes the voicesynthesis module 11F in accordance with the series of phonemes obtainedas a result of voice recognition to reproduce the voice received fromthe user. In this way, the result of voice recognition can be presentedto the user. Assume that the user says the word “mikan”. In this case,the central processing unit 11 produces a phoneme array of ‘sil m i k aN sil’ as a result of voice recognition and the voice synthesis module11F generates a sound saying: “Is it a mikan?” The flow of the procedurethen goes on to a step SP16 at which the central processing unit 11accepts a signal entered by the user by operating the confirm operator4A or the cancel operator 4B in response to the generated query sound.

The flow of the procedure then goes on to a step SP17 at which thecentral processing unit 11 forms a judgment as to whether the confirmoperator 4A or the cancel operator 4B has been operated by the user. Ifthe cancel operator 4B has been operated by the user, the centralprocessing unit 11 determines that the result of voice recognitionpresented to the user has been denied. In this case, the flow of theprocedure goes back to the step SP13 to again accept a voice input. Ifthe confirm operator 4A has been operated by the user, on the otherhand, the central processing unit 11 determines that the result of voicerecognition presented to the user has been accepted. In this case, theflow of the procedure goes on to a step SP18.

At the step SP18, the central processing unit 11 again executes thepicture synthesis module 11G to display a predetermined message on theliquid-crystal display panel 3. The message requests the user to say anattribute for the word said earlier as a voice input. An attribute is akeyword showing the property of an object identified by a word. Anattribute is used for classifying an object. In the case of the word“mikan”, for example, an attribute “fruit” is said by the user todetermine the category of the word “mikan”.

The flow of the procedure then goes on to a step SP19 at which thecentral processing unit 11 carries out voice recognition on the voicedata DA received sequentially, identifying the data DA sequentially as aseries of phonemes. As the user operates a predetermined operator to endthe voice input, the flow of the procedure goes on to a step SP20.

At the step SP20, the central processing unit 11 executes the voicesynthesis module 11F in accordance with the series of phonemes obtainedas a result of voice recognition to reproduce the voice received fromthe user. In this way, the result of voice recognition carried out onthe attribute can be presented to the user. Assume that the user saysthe word “fruit” as an attribute after saying the word “mikan”. In thiscase, the voice synthesis module 11F generates a sound saying: “Is it afruit?” The flow of the procedure then goes on to a step SP21 at whichthe central processing unit 11 accepts a signal entered by the user byoperating the confirm operator 4A or the cancel operator 4B in responseto the generated query sound.

The flow of the procedure then goes on to a step SP22 at which thecentral processing unit 11 forms a judgment as to whether the confirmoperator 4A or the cancel operator 4B has been operated by the user. Ifthe cancel operator 4B has been operated by the user, the centralprocessing unit 11 determines that the result of voice recognitionpresented to the user has been denied. In this case, the flow of theprocedure goes back to the step SP18 to again accept a voice input. Ifthe confirm operator 4A has been operated by the user, on the otherhand, the central processing unit 11 determines that the result of voicerecognition presented to the user has been accepted. In this case, theflow of the procedure goes on to a step SP23.

At the step SP23, the central processing unit 11 catalogs the word‘mikan’ into the recognition data 16A and the attribute ‘fruit’ into theknowledge base 16G. The flow of the procedure then proceeds to a stepSP24 to end the whole processing.

The knowledge base 16G is recorded attributes such as the word fruit andthe word drink showing classification of words and phrases cataloged inthe recognition data 16A. Pattern data 16E is also recorded attributeswhich make the central processing unit 11 capable of asking the user forexample a question: “What food do you like?” In response to thisquestion, let the user answer: “I like mikan (oranges).” Then, inresponse to the answer given by the user, the central processing unit 11for example makes a comment: “I don't like mikan (oranges).”

In addition to attributes, the knowledge base 16G also includes the nameand favorites of the keeper or the owner of the electronic pet apparatus1 as well as various kinds of data such as a weather forecast receivedfrom the network 18. If necessary, this data can be utilized in aconversation with the user. When the user asks a question: “What istoday's weather forecast?”, for example, the electronic pet apparatus 1is capable of giving an answer: “A clear weather” in conformity with apredetermined rule using the words ‘today’ and ‘weather’ as key phrases.

In an operation to catalog a voice input into the recognition data 16Ain the electronic pet apparatus 1 as described above, a correct text forthe voice input has to be verified not to already exist in therecognition data. In the above example, the correct text is a textdescribing the word “mikan.” Text data obtained as a result of voicerecognition is an array of alphabetical marks or an array of identifiersrepresenting a series of phonemes representing a word or a phraseentered by the user as a voice input. In the above example, the array ofalphabetical marks is ‘sil m i k a N sil’ describing a word or a phraseto be cataloged into the recognition data 16A. If necessary, a textdownloaded from the network 18 can also be cataloged into therecognition data 16A. With such a text cataloged in the recognition data16A, a response may be generated from a recorded text in place ofidentifiers corresponding to a series of phonemes obtained as a resultof voice recognition.

In the electronic pet apparatus 1, recognition data 16A of a word or aphrase cataloged as a result of recognition of a voice input isprocessed in the same way as recognition data 16A of a word or a phrasedownloaded from the network 18 and recognition data 16A of a word or aphrase cataloged in advance, allowing a conversation to be held with theuser.

1-4. User Authentication

FIG. 16 is a functional block diagram showing the electronic petapparatus 1 in an operation to authenticate the user in more detail. Inthis functional block diagram, authentication data 16K includes a username recorded in advance. It should be noted that the user name isrecorded as a result of voice recognition. Instead of obtaining the username as a result of voice recognition, the user name can be entered viathe keyboard of an external apparatus in initial setting processingwhich is typically carried out when the electronic pet apparatus 1 ispurchased.

The response-sentence creation module 11E returns for example an answersaying: “Are you really the master?” in response to a key phrase saying:“Gao” in accordance with Rule 1 of the pattern data 16E shown in FIG.17.

In accordance with Rule 2, a voice authentication module 11J sets aBoolean value ‘authenticated’ at “TRUE” (described as ‘set authenticated(TRUE)’ in Rule 2) if the following 2 conditions are satisfied:

a key phrase ‘$USER’ defined as a user name and cataloged in advance isentered as a voice input; and

a response including a phrase saying: “Are you really the master?” isgenerated by the response-sentence creation module 11E immediatelybefore the voice input ‘$USER’ as myLastUtter.

It should be noted that the function set_authenticated (TRUE) citedabove sets the Boolean-value ‘authenticated’ at TRUE.

To put it in detail, the voice recognition module 11J searches theauthentication data 16K for a user name matching a result of recognitionof the voice input. If such a name is found in the search, a personentering the voice input is authenticated as the user and anauthenticated state 16J is set at an authenticated user state. If such aname is not found in the search, on the other hand, a person enteringthe voice input is not authenticated as the user and the authenticatedstate 16J is set at an unauthenticated user state.

If the user is authenticated, the response-sentence creation module 11Egenerates a response saying: “Yes, you are the master” in accordancewith Rule 2.

As described above, the electronic pet apparatus 1 is capable ofauthenticating a user on the basis of a voice input and giving the ownera response different from a response given to a user other than theowner as shown in FIG. 8. In general, the electronic pet typicallydisplays a behavior special for the owner as an actual pet does.

Also as described above, a user is authenticated by comparing a voiceinput with a word cataloged in advance.

In addition to the name of the user who has been authenticated, thevoice recognition module 11J may also request the response-sentencecreation module 11E to output a response making an inquiry about thefavorite or the hobby of the user to be recorded in the knowledge base16G in a conversation with the user who has been authenticated as theowner as shown in FIG. 18. In the example shown in the figure, theresponse making an inquiry says: “What is your favorite food, master?”This question asks the favorite food of the voice generator who has beenauthenticated as the owner.

In response to this query, the user says: “Peanuts” as shown in FIG. 18.The word peanuts is subjected to a voice recognition process in thevoice recognition module 11J and processed in the same way as Rule 2shown in FIG. 17 to judge by the user's voice input, a response to theinquiry about a favorite. The word “peanuts” is then cataloged in theauthentication data 16K.

During a conversation with a person entering a voice input, theresponse-sentence creation module 11E generates an inquiry about afavorite, a hobby or the like cataloged in advance in the authenticationdata 16K as one shown in FIG. 19 when invoked by the timer 11B. In thecase of the favorite food cataloged in the authentication data 16K asshown in FIG. 18, for example, the response-sentence creation module 11Egenerates an inquiry: “Are you really the master? What is your favoritefood?” as shown in FIG. 19.

The voice recognition module 11J determines whether or not a voice inputgiven by the user in response to the inquiry about the favorite food istrue by carrying out the same processing as the one according to Rule 2explained earlier by referring to FIG. 17. Since the user is the ownerin this case, a voice-input response of “Peanuts” is obtained. From aresult of voice recognition of this response, an authentication state isset at a Boolean value of “TRUE”. In addition, the response-sentencecreation module 11E generates a response of “You are really my master!”

In this way, the electronic pet apparatus 1 is capable of forming ajudgment on a result of voice recognition based on a result of voicerecognition obtained in the past. To put it in detail, the electronicpet apparatus 1 is capable of making an inquiry about a result of voicerecognition obtained in the past in response to the user's input duringa conversation with the user, and forming a judgment on a result ofvoice recognition of another voice input given in response to theinquiry in order to authenticate the user.

In addition, when the user does not give a voice input in response to aninquiry made by the response-sentence creation module 11E as triggeredby the timer 11B even after a predetermined period of time has lapsed,the voice recognition module 11J assumes that the user has typicallyterminated operations of the electronic pet apparatus 1, resetting theauthentication state.

1-5. Processing to Classify Conversations

FIG. 20 is a functional block diagram showing the electronic petapparatus 1 in processing to classify conversations in more detail. Inthis functional block diagram, a word/phrase classification module 11Midentifies a result of voice recognition to classify conversationsentered as a voice input in conformity with a predeterminedclassification rule 16M, outputting a classification code to theresponse-sentence creation module 11E as a result of classification.

For example, the word/phrase classification module 11M classifies voiceinputs of general greetings such as “Good morning” and “Good day” into a“greeting” category. Voice inputs of inquiries such as “How are you?”and “What do you like?” are classified into an “inquiry” category. Voiceinputs of impressions such as “I am fine” and “Bored” are classifiedinto an “impression” category.

In an operation to create a response sentence according to the patterndata 16E, the response-sentence creation module 11E forms a responseaccording to response-sentence categories recorded in the pattern data16E and a category pattern classified by the word/phrase classificationmodule 11M. In addition, a response is created also in accordance withpast conversation records stored the conversation history 16F.

The pattern data 16E includes rules to be followed to classify responsesentences as shown in FIG. 21. The rules have the same syntax ofcomparison as the rules shown in FIG. 8. It should be noted that theclassification rules shown in FIG. 21 are set for classification to becarried out by the word/phrase classification module 11M.

According to Rule 1 shown in FIG. 21, the phrases saying: “I love you,too” and “Wow, I am a male though” are classified into a “state”category, a phrase saying: “A strange person” is classified into the“impression” category and a phrase saying: “Who are you?” is classifiedinto the “query” category. According to Rule 2, a phrase saying: “Shutup” is classified into the “impression” category, a phrase saying:“What?” is classified into the “query” category, a phrase saying:“Howdy” is classified into the “greeting” category and a phrase saying:“I am surprised” is classified into the “impression” category, a phrasesaying: “Hi” is classified into the “greeting” category and a phrasesaying: “Did you call me?” is classified into the “query” category.

In addition, the pattern data 16E also prescribes a sort of restrictionthat prohibits a conversation from comprising consecutive specificcategories. To put it concretely, the restriction is set so that noinquiry shall be returned in response to an inquiry made by the user.Furthermore, after 2 consecutive “greetings” are exchanged between theelectronic pet apparatus 1 and the user, the electronic pet apparatus 1shall not again issue a “greeting” as stipulated in a restriction shownat the bottom of FIG. 21.

A conversation history 16F shown in FIG. 22 or 23 is made by theresponse-sentence creation module 11E to record a conversation betweenthe electronic pet and the user. As shown in the figures, a historyincludes an action taking party generating a voice, the category of thevoice and the contents of the voice. The history shown in FIG. 22describes the user as a first-action taking party, the “greeting”category as a type of the voice generated by the first-action takingparty and a phrase saying: “Good day” of the “greeting” category ascontents of the voice generated by the first-action taking party. Theuser is followed by the electronic pet as a second-action taking party.The type of the voice generated by the second-action taking party isalso the “greeting” category and the contents of the voice generated bythe first-action taking party are a phrase saying: “Hi”. The electronicpet is followed by the user as a third-action taking party. The type ofthe voice generated by the third-action taking party is the “query”category and the contents of the voice generated by the third-actiontaking party are a phrase saying: “How are you doing?” The user isfollowed by the electronic pet as a fourth action-taking party. The typeof the voice generated by the fourth-action taking party is the “state”category and the contents of the voice generated by the fourth-actiontaking party are a phrase saying: “I am fine”.

When the response-sentence creation module 11E creates a response inaccordance with the pattern data 16E and on the basis of the emotiondata 16C, a conversation history 16F is used as a reference andrestrictions prescribed in the pattern data 16E are abided with. Forexample, after 2 consecutive greetings are exchanged between theelectronic pet apparatus 1 and the user, the response-sentence creationmodule 11E shall not again issue a greeting by applying Rule 2 rightafter the 2 consecutive ones as stipulated in the restriction shown inFIG. 21 even if the “joy” emotion parameter has a largest value amongthe emotion variables. In addition, no “inquiry” shall be returned inresponse to an “inquiry” made by the user.

By abiding with the restriction on greetings described above, even if afirst rule stipulates that a greeting shall be returned in response to agreeting and a second rule stipulates that a greeting shall be returnedin response to a variety of inquiries, it is possible to avoid anunnatural conversation comprising greetings exchanged between the userand the electronic pet repeatedly a number of times due to repetitiveapplication of the first and second rules described above.

1-6. Emotion Control

FIG. 24 is a functional block diagram showing the electronic petapparatus 1 in execution of emotion control in more detail. In thisfunctional block diagram, an emotion changing module 11D is activated bythe timer 11B described earlier to search the character data 16D byusing a word included in a result of voice recognition as a keyword forvariances corresponding to the word, and updates the emotion data 16C byusing the variances found in the search.

In this processing, the emotion changing module 11D records changes invariables composing the emotion data 16C, text data obtained as a resultof voice recognition of the user's input and keywords each included inthe text data and used for searching the character data 16D for thechanges as an emotion-change history 16N like one shown in FIG. 25. Inaddition, with predetermined timing typically after a response has beenoutput, the emotion-change history 16N is searched for a word usedfrequently in user inputs in conjunction with a keyword. If such a wordis found, the word is cataloged in the character data 16D as a newkeyword as shown in FIG. 26. The character data 16D shown in FIG. 26 isobtained by cataloging a new keyword in the character data 16D shown inFIG. 6. By cataloging this word in the character data 16D as a newkeyword, the variables of the emotion data 16C can be updated even whenthis word alone is input in the same way as the other keywords.

For example, assume the phrase “curry bread” is used in user inputs asshown in FIG. 25 in conjunction with the keyword “dirty” which changesthe variables of the emotion data 16C a number of times exceeding apredetermined value. In this case, the emotion changing module 11Dcatalogs the phrase “curry bread” in the character data 16D as a newkeyword as shown in FIG. 26. As shown in FIG. 26, the variables of theemotion data 16C are updated by using the same changes as the keyword“dirty” even when this phrase “curry bread” only is input.

As a result, the electronic pet apparatus 1 sets a variety of parametersand variables so that a specific emotion is resulted in by the so-calledassociative information and is hence capable of generating a responsebased on the resulting emotion.

In addition, when the emotion changing module 11D searches theemotion-change history 16N with the predetermined timing as describedabove, the frequency of using each keyword for changing the variables ofthe emotion data 16C is also found out as well. If the frequency ofusing a keyword is found greater than a predetermined value, as shown inFIG. 27, the emotion changing module 11D reduces the absolute values ofthe variations in the character data 16D for the keyword from theoriginal values shown in FIG. 6. In the emotion-change history 16N shownin FIG. 27, for example, the keyword “dirty” is used frequently. Thus,the variations in 6 variables, namely, the “anger”, “sadness”, “joy”,“fear”, “surprise” and “hatred” emotion parameters in the character data16D for the keyword “dirty” are changed by −1, −1, +2, +1, −1 and −4respectively.

In this way, the electronic pet apparatus 1 is capable of forming theso-called sense of accustoming.

If the frequency of using a keyword of the character data 16D in voiceinputs gets lower, on the other hand, the emotion changing module 11Deliminates the keyword used in voice inputs from the character data 16D.As described above, if the frequency of using a keyword is found greaterthan a predetermined value, the emotion changing module 11D reduces theabsolute values of the variations in the character data 16D for thekeyword. If the frequency of using the keyword decreases again, however,the variations are gradually restored to their original values.

In this way, the electronic pet apparatus 1 is capable of creating theso-called state of forgetting something.

1-7. Operation of the First Embodiment

In the configuration described above, the voice recognition module 11Aemployed in the electronic pet apparatus 1 shown in FIGS. 1 to 3 carriesout a voice recognition process using the HMM method on a voice inputentered by the user via the microphone 7. As described above, the voicerecognition module 11A is a functional block, the processing of which iscarried out by the central processing unit 11. In the voice recognitionprocessing, a voice is first converted into a series of phonemes whichare then transformed into text data by referring to the recognition data16A.

In the electronic pet apparatus 1, text data obtained as a result ofvoice recognition carried out in this way is supplied to thephysical-condition changing module 11C which changes the 5 elements ofthe present physical condition, namely, the “fatigue”, “hunger”,“thirstiness”, “sickness” and “sleepiness” parameters of thephysical-condition data 16B shown in FIG. 4, in accordance with a wordincluded in a voice input. When food has been given as indicated by aresult of a voice recognition, for example, the “hunger” parameter isdecreased and, when a drink is received as indicated by a result of avoice recognition, for example, the “thirstiness” parameter isdecreased.

In this way, the electronic pet apparatus 1 is capable of changing thephysical condition by a voice input entered by the user. In addition,the 5 parameters can also be changed gradually by processing carried outby the physical-condition changing module 11C and based on the timer11B. Thus, in the electronic pet apparatus 1, the physical conditionexpressed in terms of these parameters is modified by a voice inputentered by the user and changes with the lapse of time. As a result, bygenerating a response based on the 5 parameters to a voice input, thephysical condition of the electronic pet is reflected in the response tothe voice input.

In addition, the result of voice recognition is supplied also to theinformation changing module 11D which changes the emotion data 16C shownin FIG. 5 in accordance with a word included in a result of voicerecognition. Changes in emotion data 16C are described in character data16D. The 6 variables expressing the emotion are updated in accordancewith keywords and the character data 16D. To put it in detail, keywordsare the words for changing emotions of the electronic pet, while, asshown in FIG. 6, the character data 16D comprises variations in 6variables expressing the emotion, namely, the “anger”, “sadness”, “joy”,“fear”, “surprise” and “hatred” parameters, for a variety of keywords,that is, words included in voice inputs. That is to say, the emotion ischanged in accordance with a voice input entered by the user.

In this way, the electronic pet apparatus 1 changes the emotion of theelectronic pet in accordance with a voice input given by the user. Inaddition, since the electronic pet apparatus 1 creates a response to avoice input in accordance with a result of recognition of the voiceinput on the basis of the physical-condition data 16B and the emotiondata 16C, the response of the electronic pet reflects the physicalcondition and the emotion of the electronic pet.

To put it in detail, in the electronic pet apparatus 1, a result ofvoice recognition is supplied to the response-sentence creation module11E which creates a response sentence for the result of voicerecognition in accordance with rules described in the pattern data 16Eas shown in FIG. 8. To put it in detail, in the electronic pet apparatus1, the pattern data 16E describes a response sentence for each keyphrase included in the voice input. The response-sentence creationmodule 11E searches the pattern data 16E for a response sentenceassociated with the key phrase obtained as a result of voicerecognition, outputting the response sentence as a search result.

In the electronic pet apparatus 1, a actual response corresponding tothe response sentence is generated by the voice synthesis module 11F andoutput to the speaker 6. Files each containing the voice for eachresponse are shown in FIG. 9. On the other hand, a picture associatedwith the actual response is created by the picture synthesis module 11Gto be displayed on the liquid-crystal display panel 3. Files eachcontaining the picture for each response are shown in FIG. 10. In thisway, a actual response to a voice input entered by the user is presentedto the user as a voice and a picture.

Since a response is created in the electronic pet apparatus 1 from thepattern data 16E comprising rules for generating different responses inaccordance with the physical-condition data 16B and the emotion data16C, the response to the user reflects the physical condition and theemotion of the electronic pet.

In the processing sequence described above, the electronic pet apparatus1 is capable of switching the operation to a cataloging operation modein conformity with a predetermined operator carried out by the user. Inthis mode, a word and a phrase that can be recognized in a voicerecognition process are cataloged into the recognition data 16A.

To put it in detail, when a voice representing a word or the like to becataloged is received as an input from the user in this cataloging mode,the electronic pet apparatus 1 carries out the same processing as thatin the normal operating mode shown in FIGS. 13 to 15 to convert thevoice input into a series of phonemes in accordance with the syntaxshown in FIG. 14. A voice represented by this series of phonemes is thengenerated by the voice synthesis module 11F to be confirmed by the user.After the user confirms that the result of voice recognition is correct,the user enters another voice input representing the attribute of theconfirmed word or the like.

The electronic pet apparatus 1 also converts the voice inputrepresenting the attribute into a series of phonemes. If a voicegenerated from this series of phonemes is also confirmed by the user,the series of phonemes representing the word or the like entered earlieris cataloged into the recognition data 16A while the data of theattribute is cataloged into the knowledge base 16G, being associatedwith the word or the like cataloged in the recognition data 16A.

As described above, the electronic pet apparatus 1 is capable ofcataloging words and the like entered as a voice input without carryingout difficult operations on an input unit such as a keyboard, allowingthe degree of freedom to use the apparatus 1 to be raisedcommensurately. In addition, it is possible to make the word vocabularylarger to nurture the electronic pet as if the user were actuallytraining a real pet. As a result, the electronic pet can be madefamiliar and easy to get acquainted with commensurately.

As described above, in a normal voice recognition process, therecognition data 16A is searched for text data corresponding to a seriesof phonemes obtained as a result of voice conversion and the text datais output as a result of voice recognition used in creation of aresponse sentence. The text data found in the search may be a word orthe like cataloged in the cataloging mode described above. Such textdata described by a series of phonemes can also be used in creation of aresponse sentence in place of text data usually found in the normalvoice recognition process. Creation of a response sentence is also basedon an attribute recorded in the knowledge base 16G. Thus, when thephysical-condition data 16B indicates that the electronic pet is hungryand the input received from the user has a food attribute, for example,the electronic pet apparatus 1 is capable of generating a responsestating typically: “I want to eat” or “I want some food.”

As described above, a word and the attribute of the word are received asseparate voice inputs and, after the results of voice recognition of thevoice inputs are confirmed by the user, the word and the attribute arecataloged. In this way, since a word and the attribute of the word areentered by the user separately as voice inputs and their results ofvoice recognition are confirmed by the user, it is possible to catalogthe word and the attribute with ease and a high degree of reliability.

When the user enters a voice input saying: “Gao”, on the other hand, theelectronic pet apparatus 1 carries out the processing represented by thefunctional block diagram shown in FIG. 16, using the input voice as akeyword for generating a voice based on Rule 1 shown in FIG. 17 in orderto request the user to enter information cataloged in advance such asthe name of the user. A voice input entered by the user in response tothis request is subjected to a voice recognition process. The voicerecognition module 11J employed in the electronic pet apparatus 1compares a result of the voice recognition process with the recognitiondata 16K. If the outcome of the comparison authenticates the user, theauthentication state 16J is set to indicate that the person entering thevoice is the owner.

The response-sentence creation module 11E of the electronic petapparatus 1 creates a response sentence based on a rule of the patterndata 16E or Rule 1 of FIG. 8 which distinguishes a person other than theowner entering a voice input from the owner. To be more specific, theresponse-sentence creation module 11E refers to the authentication state16J and creates different responses depending on the value of theauthentication state 16J.

Thus, the electronic pet apparatus 1 is capable of responding bydisplaying a special behavior to the owner as a real pet does, allowingthe electronic pet to be made easy to get acquainted withcommensurately.

In addition, in the electronic pet apparatus 1, the timer 11B activatesthe voice authentication module 11J to carry out processing of userauthentication at predetermined intervals. In the user authenticationprocessing which is carried out at predetermined intervals, the voiceauthentication module 11J forms a judgment as to whether or not the useris the owner. As shown by a typical conversation of FIG. 19, thejudgment is based on a voice input entered by the user in response to aninquiry about the favorite, the hobby or the like of the user which wasrecorded in the knowledge base 16G as shown by a typical conversation ofFIG. 18. In this way, processing to authenticate the user can be carriedout.

Thus, the electronic pet apparatus 1 is capable of creating a responseshowing a special behavior to the owner by verifying the conversationpartner to be the user in the course of conversation when necessary.

In the course of a conversation, the word/phrase classification module11M employed in the electronic pet apparatus 1 classifies a voice inputinto a “greeting” or “query” category or the like in processing shown inFIG. 20 in accordance with the word/phrase classification rule 16M byreferring to a conversation history like one shown in FIG. 22 or 23 ifnecessary. In addition, a response to a voice input said by the user iscreated by following a category restriction described in the patterndata 16E shown in FIG. 21. If a voice input said by the user isclassified into the “query” category, for example, the classificationrestriction does not allow a response to be generated even if a keyphrase corresponding to the voice input in a rule stipulates that aquery be generated as a response to the voice input.

Thus, the electronic pet apparatus 1 is capable of avoiding an unnaturalconversation in which a query is returned in response to a query raisedby the user.

In addition, the electronic pet apparatus 1 records categories ofconsecutive words or phrases in a continuous-conversation history 16F. Aresponse to a voice input said by the user is created by referring tocategories recorded in the continuous-conversation history 16F and byconsidering a category restriction described in the pattern data 16Eshown in FIG. 21. As a result, when the user enters a greeting followinga greeting said by the electronic pet apparatus 1, the electronic petapparatus 1 does not generate another greeting in response to the user'sgreeting in accordance with the category restriction even if agreeting-to-greeting rule stipulates that a greeting shall be generatedin response to a greeting.

Thus, the electronic pet apparatus 1 is capable of avoiding an unnaturalconversation in which greetings are exchanged a number of times forever,allowing the electronic pet to be made a familiar thing.

In addition, the electronic pet apparatus 1 also carries out processingshown in the functional block diagram of FIG. 24 to record changes invariables composing the emotion data 16C, text data obtained as a resultof voice recognition of the user's input and keywords each included inthe text data in the emotion-change history 16N like the one shown inFIG. 25. In the electronic pet apparatus 1, the emotion-change history16N is searched for a word used frequently in user inputs in conjunctionwith a keyword at predetermined intervals. If such a word is found, theword is cataloged in the character data 16D as a new keyword used forchanging the emotion data 16C as shown in FIG. 26.

For example, assume that, in the electronic pet apparatus 1, the phrase“curry bread” is used in user inputs as shown in FIG. 25 in conjunctionwith the keyword “dirty” which changes the variables of the emotion data16C a number of times exceeding a predetermined value. In this case, theemotion changing module 11D catalogs the phrase “curry bread” in thecharacter data 16D as a new keyword as shown in FIG. 26. As shown inFIG. 26, the variables of the emotion data 16C are updated and aresponse is generated even when this phrase “curry word” alone is inputby using the same changes as the keyword “dirty”.

Thus, the electronic pet apparatus 1 is capable of changing the emotionof the electronic pet by a variety of variations as an animal reacts independence on conditions and as a human being changes the emotionthereof as a result of an association process. In addition, theelectronic pet apparatus 1 is capable of reflecting the variations inemotion in a response generated thereby.

In addition, when the emotion changing module 11D employed in theelectronic pet apparatus 1 searches the emotion-change history 16N, thefrequency of using each keyword for changing the variables of theemotion data 16C is also checked out as well. If the frequency of usinga keyword is found greater than a predetermined value, as shown in FIG.27, the emotion changing module 11D reduces the absolute values of thevariations in the character data 16D for the keyword. In this way, theelectronic pet apparatus 1 is capable of forming the so-called sense ofaccustoming and the state of accustoming is reflected to the response.

In the electronic pet apparatus 1 used in this way, the user is allowedto operate the operators on the front panel shown in FIG. 2 to connectthe apparatus 1 to the network 18 through the network connection unit 17shown in FIG. 1. With the network connection unit 17 connected to thenetwork 18, the electronic pet apparatus 1 is capable of downloadinginformation such as recognition data 16A, knowledge base data 16G andpattern data 16E from the network 18. As described earlier, thedownloaded information is effective rules necessary for the voicerecognition processing and the response generation processing. Thedownloaded information is also used to update the recognition data 16Aand the knowledge base 16G, allowing the user to enjoy conversationswith the electronic pet at a higher level. In addition, it is alsopossible to download voice data 16H and picture data 16I which can beused as actual response outputs. In this way, expressions of responsescan also be improved as well.

By the same token, it is also possible to transmit thephysical-condition data 16B, the emotion data 16C and the a conversationhistory 16F to a desired apparatus by way of the network 18. In thisway, the recipient apparatus is capable of reproducing the electronicpet of the electronic pet apparatus 1, allowing the electronic pet to betaken out to a variety of environments.

On the contrary, it is also possible to receive physical-condition data16B, emotion data 16C and a conversation history 16F from the network18, allowing the electronic pet apparatus 1 to generate a response as ifanother electronic pet were visiting the electronic pet apparatus 1 oras if an electronic pet raised in another electronic pet apparatus hadbeen brought out to this electronic pet apparatus 1.

1-8. Effects of the First Embodiment

According to the configuration described above, the recognition dataused as rules of voice recognition, the pattern data used as rules ofresponse generation, the emotion data used as rule of emotiongeneration, the physical-condition data, the voice data and the picturedata can be updated through the network, making it possible to generatealmost the same responses as responses generated by the anotherapparatus of the same type connected to the network as if an electronicpet had been brought out from the other apparatus to this electronic petapparatus. In addition, the electronic pet apparatus also makes theelectronic pet easy to get acquainted with as a real pet in actualtraining is. Moreover, the amount of knowledge can also be increased bytypically increasing the number of words that can be understood by theelectronic pet if necessary.

Furthermore, the recognition data can be updated by periodicalconnection to the network. Thus, the amount of knowledge can beincreased without the need for the user to take the trouble to do it.

On the contrary, the physical-condition data, the emotion data and aconversation history can be transmitted to another apparatus of the sametype by way of the network. In this way, the other apparatus is capableof generating almost the same responses as responses to voice inputsentered to this information processing apparatus such as the electronicpet apparatus, allowing the electronic pet to be treated as if theelectronic pet had been taken out to the other apparatus. As a result,the electronic pet apparatus is capable of making the electronic peteasy to get acquainted with as a real pet in actual training is.

In addition, data can be updated and transmitted by using an IC cardwhich is replaceable recording media. To be more specific, a new IC cardis mounted to update data and an IC card is taken to another apparatusto transfer data to the other apparatus. Thus, data can also beexchanged with various kinds of equipment with no communicationfunction.

Moreover, in a cataloging operation mode, a result of voice recognitionof a word and the category of the word are cataloged, allowing size ofthe vocabulary of words which can be understood by the electronic pet tobe increased with ease by voice inputs. As a result, the electronic petcan be treated in the same way as a real pet is raised in actualtraining and can be made easy to get acquainted with.

Furthermore, at that time, on the basis of series of phonemes obtainedas a result of voice recognition, the result of voice recognition of theword and the category of the word are cataloged. Thus, a word and itscategory can be cataloged by merely entering a voice input withoutcarrying out other operations.

On the top of that, a result of voice recognition is output as text datain normal processing and, in a cataloging operation, a description of aseries of phonemes is recorded. As a result, description of data such asrules can be simplified.

In addition, a word and an attribute are treated as inputs distinguishedfrom each other in the cataloging operation. As a result, the catalogingprocess can be executed with ease.

Furthermore, a result of user authentication based on a voice input isused as a basis for generating different responses for different personsentering voice inputs. Thus, a response of the electronic pet for theowner can be made different from a response for a person other than theowner. As a result, the electronic pet is capable of displaying abehavior as a real pet does and becomes more familiar as well as easierto get acquainted with.

Moreover, by using results of voice recognition obtained in the past, aresult of voice recognition obtained this time is examined toauthenticate the user. In this way, the user can be authenticated by aconversation without entering a password. As a result, the degree offreedom to use the electronic pet apparatus can be raised.

On the top of that, by using results of voice recognition obtained inthe past, the user's response to an inquiry obtained this time isexamined to authenticate the user or the user is authenticated by user'ssaying of a predetermined word. In this way, the user can beauthenticated through a natural conversation. As a result, the degree offreedom to use the electronic pet apparatus can be raisedcommensurately.

In addition, by identifying the type of a voice input and by generatinga response other than a response of a predetermined type or generating aresponse of a category corresponding to the identified type of the voiceinput, it is possible to avoid an unnatural conversation like one inwhich an inquiry is made in response to an inquiry. In this way, aresponse given by the electronic pet can be made natural as well aslively. As a result, the electronic pet can be made more familiar andeasier to get acquainted with.

Furthermore, at that time, generation of a response by referring to ahistory including the types of input and responses can avoid anunnatural conversation like one in which greetings are exchangedrepeatedly a number of times. In this way, a response given by theelectronic pet can be made natural as well as lively. As a result, theelectronic pet can be made more familiar and easier to get acquaintedwith.

Moreover, variations in emotion parameters can be changed in accordancewith a history of result of the voice recognition and correspondingemotion parameters. For a voice heard frequently, for example, it ispossible to generate a response full of emotions of intimacy,familiarity and the like. In this way, a response given by theelectronic pet can be made natural as well as lively. As a result, theelectronic pet can be made more familiar and easier to get acquaintedwith.

To be more specific, if a word other than a specific word exciting anemotion is used at the same times as the specific word or as frequentlyas the specific word is, this frequently used word also changes theemotion parameters. That is to say, it is possible to generate aresponse based on an emotion changed by a repeatedly used word incombination with the specific word.

On the top of that, if a specific word among words exciting an emotionis used frequently, variations in emotion parameters are decreased. As aresult, the so-called sense of accustoming can be formed.

2. Effects of Other Embodiments

In the embodiment described above, the electronic pet apparatus can beconnected to a network to take out the electronic pet from theelectronic pet apparatus, to generate a response of an electronic petraised in another apparatus and to teach the electronic pet nurtured inthis electronic apparatus a variety of rules and various kinds ofinformation. It should be noted, however, that the scope of the presentinvention is not limited to this embodiment. For example, only some ofthe processing described above can be made to be carried out whennecessary. In addition, the electronic pet apparatus makes an access tothe network periodically when the user carries out a predeterminedoperation or when a call is received from another apparatus.

Moreover, according to the embodiment described above, the electronicpet apparatus is connected to a network by a telephone line. It is worthnoting, however, that the invention can also be applied to applicationswherein the electronic pet apparatus is connected to a network throughother equipment such as a modem or a personal computer.

Furthermore, in the embodiment described above, the so-called electronicpet learns recognition data, pattern data, voice data and picture datadownloaded from a network. It should be noted, however, that the scopeof the present invention is not limited to this embodiment. For example,the electronic pets may also learn only some of the downloaded data asnecessary. In addition, the technique itself to recognize a voice, thetechnique to generate voice data and the technique to generate picturedata themselves can be modified by downloaded control programsdescribing the techniques. By the same token, the technique to generateemotion data and the processing of the response-sentence creation moduleand other processing can also be changed.

On the top of that, according to the embodiment described above,physical-condition data, emotion data and a conversation history can betransmitted to another apparatus in order to take out the electronic petthereto. It is worth noting, however, that the scope of the presentinvention is not limited to such an embodiment. For example, when onlysome of the data is transmitted or the data is transmitted along withinformation such as knowledge, other apparatus may carry out processingto emulate the electronic pet of this electronic pet apparatus. Inaddition, instead of transmitting such data, a response to an inputobtained as a result of voice recognition carried out by anotherapparatus can be transmitted to the other apparatus.

Furthermore, according to the embodiment described above, various kindsof data can be input from another apparatus in order to bring out theelectronic pet of the other apparatus to this electronic pet apparatus.It should be noted, however, that the scope of the present invention isnot limited to such an embodiment. For example, when only some of thedata is received or the data received along with information such asknowledge, processing to emulate the electronic pet of the otherapparatus can be carried out. In addition, instead of internallyprocessing such data received from the other equipment, this electronicpet apparatus may transmit a result of voice recognition to the otherapparatus and then receives a response to the result of voice generationfrom the other apparatus.

Moreover, in the embodiment described above, a voice input is subjectedto a voice recognition process to convert the input into a series ofphonemes. It is worth noting, however, that the scope of the presentinvention is not limited to such an embodiment. For example, a varietyof voice recognition techniques proper for processing requirements canalso be adopted.

On the top of that, in the embodiment described above, a word and theattribute of the word are each entered as a voice input to be catalogedin the electronic pet apparatus. It should be noted, however, that thescope of the present invention is not limited to such an embodiment. Forexample, an attribute can be selected and entered to the electronic petapparatus by the user by operating an operator. In this case, there is aconceivable technique whereby the user is requested to enter anattribute by selecting an item on a displayed menu.

Furthermore, according to the embodiment described above, for a voiceinput to be cataloged in an authentication data as text data of a seriesof phonemes representing the voice input, a result of voice recognitionis output as a series of phonemes. As for an ordinary result of voicerecognition, ordinary text data is merely produced. It is worth noting,however, that the scope of the present invention is not limited to suchan embodiment. For example, also for an ordinary result of voicerecognition, the result of voice recognition can be output as a seriesof phonemes.

Moreover, in the embodiment described above, the user is authenticatedby identifying the name or the favorite of the user. It should be noted,however, that the scope of the present invention is not limited to suchan embodiment. For example, the present invention can also be applied toa wide range of applications wherein the user is authenticated byverifying a password, a date of a past conversation or a past result ofvoice recognition.

On the top of that, in the embodiment described above, the user isauthenticated by checking a special phrase said by the user in responseto a predetermined query made by the electronic pet apparatus and theuser is authenticated periodically. It is worth noting, however, thatthe scope of the present invention is not limited to such an embodiment.For example, the user can also be authenticated either by verificationof a special phrase or periodically as necessary.

Furthermore, according to the embodiment described above, in a processto recognize a voice input by splitting the input into a series ofphonemes, the user is authenticated by verifying a generated voicerepresenting a special word. It should be noted, however, that the scopeof the present invention is not limited to such an embodiment. Forexample, the user can also be authenticated by verifying a variety ofcharacteristic quantities representing the characteristics of the user'svoice to give the same effect as the embodiment described above.Examples of the characteristics quantities are the tone and thefrequency spectrum of the voice.

Moreover, in the embodiment described above, a response of theelectronic pet for the owner can be made different from a response for aperson other than the owner. It is worth noting, however, that the scopeof the present invention is not limited to such an embodiment. Forexample, more different responses can be generated for more differentpersons providing voice inputs such as members of the family of theowner and persons other than family members.

On the top of that, in the embodiment described above, an inquiry isprevented from being issued in response to an inquiry in a simple mannerbased on the type of the inquiry input and the type of the inquiryresponse. It should be noted, however, that the scope of the presentinvention is not limited to such an embodiment. For example, an inquirymay be issued in response to an inquiry due to reasons such as theemotion. In this case, it shows that the electronic pet is in the badmood.

Furthermore, according to the embodiment described above, the emotion iscontrolled by manipulation of character data. It is worth noting,however, that the scope of the present invention is not limited to suchan embodiment. For example, the emotion data can also be changeddirectly instead of manipulating the character data.

Moreover, the embodiment described above outputs voice data and picturedata. It should be noted, however, that the scope of the presentinvention is not limited to such an embodiment. For example, voices andpictures are output as a result of audio and video syntheses.

On the top of that, in the embodiment described above, the voicerecognition processing and the picture synthesis processing are carriedout by the central processing unit as shown in FIG. 3. It is worthnoting, however, that the scope of the present invention is not limitedto such an embodiment. For example, the voice recognition processing andthe picture synthesis processing can also be carried out by dedicatedcircuits as shown in FIG. 28.

Furthermore, the embodiment described above applies the presentinvention to an electronic pet apparatus outputting a voice and apicture as a response. It should be noted, however, that the scope ofthe present invention is not limited to such an embodiment. For example,the present invention can also be applied for example to a robot movinglike an animal, an electronic pet apparatus moving and crying to outputa response and an electronic pet apparatus outputting responses in avariety of forms.

Moreover, the embodiment described above applies the present inventionto an electronic pet apparatus which is a special-purpose apparatus foremulating an electronic pet with the front panel thereof shown in FIG.2. It is worth noting, however, that the scope of the present inventionis not limited to such an embodiment. For example, the present inventioncan also be applied to a variety of portable devices such as a portabletelephone, a portable GPS, a portable tape recorder and a portableoptical-disc drive with a front panel thereof shown in FIG. 28. Inaddition to such portable devices, the present invention can also beapplied to information processing apparatuses such as a personalcomputer in which a variety of animation characters or the like move.

Industrial Applicability

The present invention can be utilized for an entertainment robot.

What is claimed is:
 1. An electronic pet apparatus characterized in thatsaid apparatus comprises: voice input means for inputting a voice outputby the user; voice recognition means for recognizing a voice receivedfrom said voice input means and for outputting a result of voicerecognition in conformity with a predetermined recognition rule; emotiongeneration means for generating an emotion parameter, which indicates anemotion in a pseudo manner as well as varies at least in accordance witha result of voice recognition and varies with the lapse of time, inconformity with a predetermined emotion-parameter generation rule;response generation means for generating a response to a result of voicerecognition in conformity with a predetermined response generation rulebased on at least said emotion parameter; response output means foroutputting said response; and communication means for carrying outprocessing to update said recognition rule, said emotion-parametergeneration rule and said response generation rule by connection to apredetermined network; or communication means for carrying outprocessing to update data necessary for said recognition rule, saidemotion-parameter generation rule and said response generation rule byconnection to said predetermined network.
 2. The electronic petapparatus according to claim 1, said apparatus characterized in thatsaid communication means periodically connects said electronic petapparatus to said network in order to carry out said update processing.3. The electronic pet apparatus according to claim 1, said apparatuscharacterized in that at least said emotion parameter or data requiredin generation of said emotion parameter can be updated by using datastored in replaceable recording media.
 4. An electronic pet apparatuscharacterized in that said apparatus comprises: voice input means forinputting a voice output by the user; voice recognition means forrecognizing a voice received from said voice input means and foroutputting a result of voice recognition in conformity with apredetermined recognition rule; emotion generation means for generatingan emotion parameter, which indicates an emotion in a pseudo manner aswell as varies at least in accordance with a result of voice recognitionand varies with the lapse of time, in conformity with a predeterminedemotion-parameter generation rule; response generation means forgenerating a response to a result of voice recognition in conformitywith a predetermined response generation rule based on at least saidemotion parameter; response output means for outputting said response;and communication means for carrying out processing to acquire at leastsaid emotion parameter or data necessary for generating said emotionparameter by connection to said predetermined network, wherein saidresponse generation means generates a response depending on said emotionparameter acquired by said communication means or a response dependingon an emotion parameter generated from said data acquired by saidcommunication means.
 5. The electronic pet apparatus according to claim4, said apparatus characterized in that at least said emotion parameteror data required in generation of said emotion parameter can be updatedby using data stored in replaceable recording media.
 6. An electronicpet apparatus characterized in that said apparatus comprises: voiceinput means for inputting a voice output by the user; voice recognitionmeans for recognizing a voice received from said voice input means andfor outputting a result of voice recognition; a data base comprising aresult of voice recognition of a word included in the voice and the typeof said word; response generation means for searching, in accordancewith a result of voice recognition, said data base for the type of aword included in the voice represented by said result of voicerecognition and for generating a response to said result of voicerecognition in dependence of said type; response output means foroutputting said response; and cataloging means capable of changing saiddata base in accordance with the voice representing a word in acataloging operation mode at least by cataloging a result of recognitionof said word into said data base.
 7. The electronic pet apparatusaccording to claim 6, said apparatus characterized in that: said voicerecognition means identifies a voice and outputs a result of voicerecognition as a series of phonemes; and said cataloging means records aresult of voice recognition of a word included in the voice and the typeof said word according to a series of phonemes representing a result ofvoice recognition into said data base.
 8. The electronic pet apparatusaccording to claim 6, said apparatus characterized in that said voicerecognition means outputs a result of voice recognition as text dataobtained as a result of conversion of the voice.
 9. The electronic petapparatus according to claim 6, said apparatus characterized in that, ina cataloging operation mode, said voice recognition means carries out avoice recognition process by delimiting the voice by using predetermineddelimiters.
 10. The electronic pet apparatus according to claim 6, saidapparatus characterized by further having an emotion generation meansfor generating a pseudo emotion parameter, which indicates an emotion ina pseudo manner as well as varies at least in accordance with a resultof voice recognition and varies with the lapse of time, in conformitywith a predetermined pseudo-emotion-parameter generation rule, whereinsaid response generation means generates a response to a result of voicerecognition in conformity with a predetermined response generation ruletaking at least said pseudo emotion parameter as a reference.
 11. Theelectronic pet apparatus according to claim 6, said apparatuscharacterized by being capable of exchanging at least said pseudoemotion parameter or data necessary for generation of said pseudoemotion parameter through replaceable recording media.
 12. An electronicpet apparatus characterized in that said apparatus comprises: voiceinput means for inputting a voice output by the user; voice recognitionmeans for recognizing a voice received from said voice input means andfor outputting a result of voice recognition; response generation meansfor generating a response to a result of voice recognition in conformitywith a predetermined response generation rule; response output means foroutputting said response; and user authentication means forauthenticating the user on the basis of the voice output by said user,wherein said response generation means generates a response to a personentering a voice with said response varied in dependence on a result ofauthentication produced by said user authentication means.
 13. Theelectronic pet apparatus according to claim 12, said apparatuscharacterized in that said user authentication means forms a judgment ona result of voice recognition with a past result of voice recognitionused as a reference and authenticates the user on the basis of a resultof said judgment.
 14. The electronic pet apparatus according to claim13, said apparatus characterized in that: said response generation meansraises a query about a past result of voice recognition as a response;and said user authentication means forms a judgment on a result of voicerecognition of a response to said query in order to authenticate theuser.
 15. The electronic pet apparatus according to claim 13, saidapparatus characterized in that said past result of voice recognition isa predetermined word.
 16. The electronic pet apparatus according toclaim 12, said apparatus characterized in that said user authenticationmeans detects a characteristic quantity representing a characteristic ofthe voice output by the user from a result of voice recognition andauthenticates said user on the basis of said characteristic quantity.17. The electronic pet apparatus according to claim 12, said apparatuscharacterized by further having an emotion generation means forgenerating a pseudo emotion parameter, which indicates an emotion in apseudo manner as well as varies at least in accordance with a result ofvoice recognition and varies with the lapse of time, in conformity witha predetermined pseudo-emotion-parameter generation rule, wherein saidresponse generation means generates a response to a result of voicerecognition in conformity with a predetermined response generation ruletaking at least said pseudo emotion parameter as a reference.
 18. Anelectronic pet apparatus characterized in that said apparatus comprises:voice input means for inputting a voice output by the user; voicerecognition means for recognizing a voice received from said voice inputmeans and for outputting a result of voice recognition; responsegeneration means for generating a response to a result of voicerecognition in conformity with a predetermined response generation rule;response output means for outputting said response; and word/phraseclassification means for identifying the type of an input represented bythe voice based on said voice, wherein said response generation rule isa rule of generating responses excluding a response of a predeterminedtype in accordance with the type of an input and a category of aresponse to said input.
 19. The electronic pet apparatus according toclaim 18, said apparatus characterized by further having a historyrecording means used for storing a history of at least types of inputseach entered as a voice and categories of responses to said inputsgenerated by said response generation means, wherein said responsegeneration means generates a response by referring to said historystored in said history recording means.
 20. The electronic pet apparatusaccording to claim 18, said apparatus characterized by further having anemotion generation means for generating a pseudo emotion parameter,which indicates an emotion in a pseudo manner as well as varies at leastin accordance with a result of voice recognition and varies with thelapse of time, in conformity with a predeterminedpseudo-emotion-parameter generation rule, wherein said responsegeneration means generates a response to a result of voice recognitionin conformity with a predetermined response generation rule taking atleast said pseudo emotion parameter as a reference.
 21. An electronicpet apparatus characterized in that said apparatus comprises: voiceinput means for inputting a voice output by the user; voice recognitionmeans for recognizing a voice received from said voice input means inconformity with a predetermined recognition rule and for outputting aresult of voice recognition; emotion generation means for generating anemotion parameter, which indicates an emotion in a pseudo manner as wellas varies at least in accordance with a result of voice recognition andvaries with the lapse of time, in conformity with a predeterminedemotion-parameter generation rule; response generation means forgenerating a response to a result of voice recognition in conformitywith a predetermined response generation rule taking at least saidemotion parameter as a reference; and response output means foroutputting said response, wherein said emotion generation means has ahistory recording means used for recording a history of at least aresult of voice recognition and said emotion parameter corresponding tosaid result of voice recognition, and a variation in said emotionparameter according to said result of voice recognition is changed inaccordance with said history.
 22. The electronic pet apparatus accordingto claim 21, said apparatus characterized in that: said emotiongeneration means changes said emotion parameter in accordance with aword which is included in an input voice and excites an emotion; andwhen a specific word other than said word exciting an emotion is used atthe same time as said word exciting an emotion and as many times as saidword exciting an emotion is, said specific word used at the same time assaid word exciting an emotion and as many times as said word exciting anemotion also causes said emotion parameter to be changed as said wordexciting an emotion does.
 23. The electronic pet apparatus according toclaim 21, said apparatus characterized in that: said emotion generationmeans changes said emotion parameter in accordance with a word which isincluded in an input voice and excites an emotion; and when a particularword exciting an emotion is used frequently, a variation in said emotionparameter caused by said particular word exciting an emotion is reduced.